Leave 'em alone - why continuous variables should be analyzed as such.
نویسندگان
چکیده
found that models with a categorized exposure variable removed only 67% of the confounding controlled when the continuous version was used. Categorizing continuous variables may not only miss the message, it can also get it wrong. Under some circumstances, categorizing continuous variables can give biased results. In a simulation study, Taylor and Yu [6] found that categorizing one continuous variable can artificially make another variable appear associated with the outcome. Selvin [4] showed that the cutpoint chosen during the categorization of continuous variables significantly changed the calculated odds ratio. Royston et al. [2] found that the significant association of the ‘S phase fraction’ with cancer outcomes repeatedly came and went depending on which cutpoint was used to define ‘abnormal’. Ragland [7] showed similar findings with prevalence ratios of hypertension. Information loss and bias from categorizing continuous variables explain why statisticians frequently warn us to leave continuous variables alone [2, 8] . It appears that this advice was lost to investigators – ourselves included – who have developed risk stratification schemes for patients with atrial fibrillation (AF). Quantifying stroke risk in AF is essential for patient management: high-risk patients require oral anticoagulants while low-risk patients (who stand to have a minimal absolute benefit from treatment) can avoid such a therapy. Patient age is significantly associated with stroke Continuous variables – be they outcomes, exposures or covariates – are common in clinical studies. They are frequently modified into categorical variables during their analysis. Pocock et. al. [1] found that 84% of epidemiological articles from leading journals categorized continuous variables. Such a categorization could be done for several reasons [2] . It is commonly perceived that categorization makes it easier to report and interpret final results (‘X doubles the risk of Y’ vs. ‘The risk of Y doubles when X increases by 10 units’). Researchers may be uncomfortable assuming a linear relationship between a continuous variable and the outcome but are unfamiliar with methods of handling non-linearity. Researchers and analysts may have less experience in dealing with continuous variables and prefer to make them behave like the more familiar categorical ones. Finally, it is also possible that physicians and epidemiologists, who frequently categorize continuous measures during their routine life (hypertensive or not, dyslipidemic or not, etc.), instinctually transplant this training from the clinic or field to their analysis. However, categorizing continuous variables can cause problems. The first is information loss. Zhao and Kolonel [3] found that analyses with categorized continuous variables required greater than 40% more patients for the same power as that achieved using continuous variables. Selvin [4] derives a formula to calculate the efficiency loss due to categorizing a continuous variable. Becher et al. [5] Received: February 9, 2008 Accepted: February 9, 2008 Published online: April 17, 2008
منابع مشابه
The Fundamental Reasons Why Laptop Computers should not be Used on Your Lap
As a tendency to use new technologies, gadgets such as laptop computers are becoming more popular among students, teachers, businessmen and office workers. Today laptops are a great tool for education and learning, work and personal multimedia. Millions of men, especially those in the reproductive age, are frequently using their laptop computers on the lap (thigh). Over the past several years, ...
متن کاملA review of Fischer-Tropsch synthesis on the cobalt based catalysts
Fischer-Tropsch synthesis is a promising route for production of light olefins via CO hydrogenation over transition metals. Co is one of the most active metals for Fischer-Tropsch synthesis. Some different variables such as preparation parameters and operational factors can strongly affect the selectivity of Fischer-Tropsch synthesis toward the special products. In the case of preparat...
متن کاملFeasibility Study of the Electromagnetic Damper for Cable Structures Using Real-Time Hybrid Simulation
Cable structure is a major component of long-span bridges, such as cable-stayed and suspension bridges, and it transfers the main loads of bridges to the pylons. As these cable structures are exposed to continuous external loads, such as vehicle and wind loads, vibration control and continuous monitoring of the cable are required. In this study, an electromagnetic (EM) damper was designed and f...
متن کاملIntroducing Culturally-Adaptive English Language Pedagogy (CELP): Integrating Critical Cultural Awareness through the ‘little-c’ Culture in Iran’s EFL Curriculum
In teaching a foreign language (FL), some cultural specificities (defined under the rubric of ‘little-c culture’) may totally conflict with the cultural norms of the learners’ first language (L1). To prevent such imminent problems, this paper recommended that the FL syllabus be designed in a way so as to equip learners with an intimate knowledge of the target language culture, and that la...
متن کاملAnalysis by categorizing or dichotomizing continuous variables is inadvisable: an example from the natural history of unruptured aneurysms.
In medical research analyses, continuous variables are often converted into categoric variables by grouping values into ≥2 categories. The simplicity achieved by creating ≥2 artificial groups has a cost: Grouping may create rather than avoid problems. In particular, dichotomization leads to a considerable loss of power and incomplete correction for confounding factors. The use of data-derived "...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Neuroepidemiology
دوره 30 3 شماره
صفحات -
تاریخ انتشار 2008